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2 .input device 

210 .. oocument.tob6-burveyed <j condit ion input unit 
230... extract ch condition ano otmeh input unit 
230 .. documents- t04e compared p condition input unit 
i— processing device 

110 .. DOCUMENT-TO-BE -SURVEYED <l READ OUT UNIT 
1 20... INDEX ENTRY WORD <S) EXTRACTION UNIT 

130 .. OCCUNCNTS- TO-BE COMPARED P READ OUT UNIT 
140 . INDEX ENTRY WORD IP) EXTRACTION UNI I 

131 . TF (d) CALCULATION UNfT 
Ml... TF(P1 CALCULATION UNIT 
142 . OF (P) CALCULATION UNIT 
ISO .. SIMlARrTY CALCULAHON UNIT 

160... SMO-AR DOCUMENTS S SELECTION UNIT 
1 70 .. INDEX ENTRY WORD (8) EXTRACTION UNIT 
1 7 1... IDF (S) CALCULATION UNIT 

180 .. CHARACTERISTIC INDEX ENTRY WORD EXTRACTION UNIT 
4... OUTPUT DEVICE 

410 .. MAP CREATION CONDITION READ OUT UNM 

412-.. MAP DATA ACQUlSmOM UNIT 

430 . L 1ST OUTPUT CONDITION READ OUT UNIT 

432 .LIST DATA ACQUSmON UNIT 

430 .. COMMENT ADDITION CONDinON READ OUT UMT 

432 .. COMMENT ADDITION UNIT 

440 .. MAfVLISftCOMMENT COMPLEX OUTPUT UNIT 

3 .. HECORDMG DEVICE 

310... CONDITION RECORDING UNIT 

330 .. WORK RESULT STORAGE UNIT 

330 .. DOCUMENT STORAGE UNIT 
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(57) Abstract: An index entry word extraction device includes: input means (I) 
for inputting a document-to-be-surveyed d and documents -to-be- compared P; index 
entry word extraction means (120) for extracting an index entry word from the 
document-to-be-surveyed d; first appearance frequency calculation means (142) for 
calculating a function value IDF (P) of the appearance frequency of the extracted index 
entry word in the documents- to-be -compared P; similar documents selecting means 
(160) for selecting similar documents S similar to the document-to-be-surveyed d in the 
documents-to-be-compared P according to the data on the document-to-be-surveyed 
d; second appearance frequency calculation means (171) for calculating the function 
value IDF (S) of the appearance frequency of the extracted index entry word in the 
similar documents S; and output means (4) for outputting each index entry and its 
positioning data according to the combination of the function values of the respective 
appearance frequencies in the documents -to -be -compared and the similar documents 
which have been calculated. Thus, it is possible to accurately grasp the feature of the 
document-to-be-surveyed. 
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